Design for an Optimal Probe

نویسنده

  • Michael O. Duff
چکیده

Given a Markov decision process (MDP) with expressed prior uncertainties in the process transition probabilities, we consider the problem of computing a policy that optimizes expected total (finite-horizon) reward. Implicitly, such a policy would effectively resolve the "exploration-versus-exploitation tradeoff" faced, for example, by an agent that seeks to optimize total reinforcement obtained over the entire duration of its interaction with an uncertain world. A Bayesian formulation leads to an associated MDP defined over a set of generalized process "hyperstates" whose cardinality grows exponentiaily with the planning horizon. Here we retain the full Bayesian framework, but sidestep intractability by applying techniques from reinforcement learning theory. We apply our resulting actor-critic algorithm to a problem of "optimal probing," in which the task is to identify unknown transition probabilities of an MDP using online experience.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and fabrication of a high-Q near-field probe for subsurface crack detection

Non-destructive detection and evaluation of invisible cracks in metal structures is an important matter in several critical environments including ground transportation, air transportation and power plants. In this paper, a high-Q near-field Microwave probe is designed and fabricated using defected ground structures for surface and subsurface crack detection in metal structures. For this purpos...

متن کامل

Hybrid Model for Bulk Current Injection Probe

A new hybrid-model for BCI probe is derived . This model is built based on the probe's internal structure without refinements, and by carrying out just one electrical measurement for the reflection coefficient, so that it can be generalized and used in studying the effect of layout parameters in the aim of improving the probe high frequency performance, which helps the developer in design stage...

متن کامل

An Efficient Bayesian Optimal Design for Logistic Model

Consider a Bayesian optimal design with many support points which poses the problem of collecting data with a few number of observations at each design point. Under such a scenario the asymptotic property of using Fisher information matrix for approximating the covariance matrix of posterior ML estimators might be doubtful. We suggest to use Bhattcharyya matrix in deriving the information matri...

متن کامل

Using Boehmite Nanoparticles as an Undercoat, and Riboflavin as a Redox Probe for Immunosensor Designing: Ultrasensitive Detection of Hepatitis C Virus Core Antigen

In this study a label-free electrochemical Immunosensor for ultrasensitive detection of Hepatitis C virus core antigen in serum samples was fabricated by using a simple approach. In this method a low-cost and sensitive immunosensor was fabricated based on a boehmite nanoparticles (BNPs) modified glassy carbon. The BNPs provide a specific platform with increased surface area which is capable of ...

متن کامل

Design, Evaluation and Prototyping of a New Robotic Mechanism for Ultrasound Imaging

This paper presents a new robotic mechanism for ultrasound imaging. The device is placed on a patient's body by an operator, and an ultrasound expert controls the motions of the device to obtain ultrasound images. The paper focuses on the robotic mechanism that performs ultrasound imaging. The design of the mechanism is based on two approaches to produce center of motion for an ultrasound probe...

متن کامل

PERFORMANCE BASED OPTIMAL SEISMIC DESIGN OF RC SHEAR WALLS INCORPORATING SOIL–STRUCTURE INTERACTION USING CSS ALGORITHM

In this article optimal design of shear walls is performed under seismic loading. For practical aims, a database of special shear walls is created. Special shear walls are used for seismic design optimization employing the charged system search algorithm as an optimizer. Constraints consist of design and performance limitations. Nonlinear behavior of the shear wall is taken into account and per...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003